Extending the Entity-based Coherence Model with Multiple Ranks

نویسندگان

  • Vanessa Wei Feng
  • Graeme Hirst
چکیده

We extend the original entity-based coherence model (Barzilay and Lapata, 2008) by learning from more fine-grained coherence preferences in training data. We associate multiple ranks with the set of permutations originating from the same source document, as opposed to the original pairwise rankings. We also study the effect of the permutations used in training, and the effect of the coreference component used in entity extraction. With no additional manual annotations required, our extended model is able to outperform the original model on two tasks: sentence ordering and summary coherence rating.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Optimal Approach to Local and Global Text Coherence Evaluation Combining Entity-based, Graph-based and Entropy-based Approaches

Text coherence evaluation becomes a vital and lovely task in Natural Language Processing subfields, such as text summarization, question answering, text generation and machine translation. Existing methods like entity-based and graph-based models are engaging with nouns and noun phrases change role in sequential sentences within short part of a text. They even have limitations in global coheren...

متن کامل

Extending the Entity-grid Coherence Model to Semantically Related Entities

This paper reports on work in progress on extending the entity-based approach on measuring coherence (Barzilay & Lapata, 2005; Lapata & Barzilay, 2005) from coreference to semantic relatedness. We use a corpus of manually annotated German newspaper text (TüBa-D/Z) and aim at improving the performance by grouping related entities with the WikiRelate! API (Strube & Ponzetto, 2006).

متن کامل

Non Secretory Multiple Myeloma With HCV Infection: A Rare Case Entity

Multiple Myeloma is a neoplasm of B cell lineage characterized by excessive proliferation of abnormal plasma cells. It is characterized by a clinical  pentad of 1) anemia, 2) a monoclonal protein in the serum or the urine or both, 3) bone leisons and or bone pain, 4) hypercalcemia >11.5g/dl and 5) renal insufficiency. Non secretory multiple myeloma is a rare variant of the classic form of multi...

متن کامل

A Novel Approach to Conditional Random Field-based Named Entity Recognition using Persian Specific Features

Named Entity Recognition is an information extraction technique that identifies name entities in a text. Three popular methods have been conventionally used namely: rule-based, machine-learning-based and hybrid of them to extract named entities from a text. Machine-learning-based methods have good performance in the Persian language if they are trained with good features. To get good performanc...

متن کامل

The Mediating Role of Sense of Coherence in the Relationship between Perceived Stress with Fatigue and Pain in Multiple Sclerosis Patients

Background and Objectives: Fatigue and pain are the common complications in multiple sclerosis patients, which is influenced by the patients’ psychology as well as stress. The current study aimed at investigating protective mediating role of sense of coherence in the relationship between perceived stress and fatigue/pain in Iranian MS patients. Methods: This cross-sectional study was carried ou...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012